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1 AAAAAGAAA6 6AA6AAAATG GAAATACAAC AAACACACCG CAAAATCAAT 
51 CGCCCTCTGG TTTCTCTCGC TTTAGTAGGA GCATTAGTCA GCATCACACC 
101 GCAACAAAGT CATGCCGCCT TTTTCACAAC CGTGATCATT CCAGCCATTG 
151 TTGGGGGTAT CGCTACAGGC ACCGCTGTAG GAACGGTCTC AGGGCTTCTT 
201 AGCTGGGGGC TCAAACAAGC CGAAGAAGCC AATAAAACCC CAGATAAACC 
251 CGATAAAGTT TGGCGCATTC AAGCAGGAAA AGGCTTTAAT GAATTCCCTA 
301 ACAAGGAATA CGACTTATAC AGATCCCTTT TATCCAGTAA GATTGATGGA 
351 GGTTGGGATT GGGGGAATGC CGCTAGGCAT TATTGGGTCA AAGGCGGGCA 
401 ACAGAATAAG CTTGAAGTGG ATATGAAAGA CGCTGTAGGG ACTTATACCT 
451 TATCAGGGCT TAGAAACTTT ACTGGTGGGG ATTTAGATGT CAATATGCAA 
501 AAAGCCACTT TACGCTTGGG CCAATTCAAT GGCAATTCTT TTACAAGCTA 
551 TAAGGATAGT GCTGATCGCA CCACGAGAGT GATTTCAACG CTAAAAATAT 
601 CTCAATTGAT AATTTTGCAG AAATCAACAA CTCGTGTGGG TTCTGGAGCC 
651 GGGAGGAAAG CCAGCTCTAC GGTTTTGACT TTGCAAGCTT CAGAAGGGAT 
701 CACTAGCGAT AAAAACGCTG AAATTTCTCT TTATGATGGT GCCACGCTCA 
751 ATTTGGCTTC AAGCAGCGTT AAATTAATGG GTAATGTGTG GATGGGCCGT 
801 TTGCAATACG TGGGAGCGTA TTTGGCCCCT TCATACAGCA CGATAAACAC 
851 TTCAAAAGTA ACAGGGGAAG TGAATTTTAA CCACCTCACT GTTGGCGATA 
901 AAAACGCCGC TCAAGCGGGC ATTATCGCTA ATAAAAAGAC TAATATTGGC 
951 ACACTGGATT TGTGGCAAAG CGCCGGGTTA AACATTATCG CTCCTCCAGA 
1001 AGGTG6CTAT AAGGATAAAC CCAATAATAC CCCTTCTCAA AGTGGTGCTA 
1051 AAAACGACAA AAATGAAAGC GCTAAAAACG ACAAACAAGA GAGCAGTCAA 
1101 AATAATAGTA ACACTCAGGT CATTAACCCA CCCAATAGTG CGCAAAAAAC 
1151 AGAAGTTCAA CCCACGCAAG TCATTGATGG GCCTTTTGCG GGCGGCAAAG 
1201 ACACGGTTGT CAATATCAAC CGCATCAACA CTAACGCTGA TGGCACGATT 
1251 AGAGTGGGAG GGTTTAAAGC TTCTCTTACC ACCAATGCGG CTCATTTGCA 
1301 TATCGGCAAA GGCGGTGTCA ATCTGTCCAA TCAAGCGAGC GGGCGCTCTC 
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^qess^ 1351 TTATAGTGGA AAATCTAACT GGGAATATCA CCGTTGATGG GCCTTTAAGA 
1401 GTGAATAATC AAGTGGGTGG CTATGCTTTG GCAGGATCAA GCGCGAATTT 
1451 TGAGTTTAAG GCTGGTACGG ATACCAAAAA CGGCACAGCC ACTTTTAATA 
1501 ACGATATTAG TCTGGGAAGA TTTGTGAATT TAAAGGTGGA TGCTCATACA 
1551 GCTAATTTTA AAGGTATTGA TACGGGTAAT GGTGGTTTCA ACACCTTAGA 
1601 TTTTAGTGGC GTTACAGACA AAGTCAATAT CAACAAGCTC ATTACGGCTT 
1651 CCACTAATGT GGCCGTTAAA AACTTCAACA TTAATGAATT GATTGTTAAA 
1701 ACCAATGGGA TAAGTGTGGG GGAATATACT CATTTTAGCG AAGATATAGG 
1751 CAGTCAATCG CGCATCAATA CCGTGCGTTT GGAAACTGGC ACTAGGTCAC 
1801 TTTTCTCTGG GGGTGTTAAA TTTAAAGGTG GCGAAAAATT GGTTATAGAT 
1851 GAGTTTTACT ATAGCCCTTG GAATTATTTT GACGCTAGAA ATATTAAAAA 
1901 TGTTGAAATC ACCAATAAAC TTGCTTTTGG ACCTCAAGGA AGTCCTTGGG 
1951 GCACATCAAA ACTTATGTTC AATAATCTAA CCCTAGGTCA AAATGCGGTC 
2001 ATGGATTATA GCCAATTTTT AAATTTAACC ATTCAAGGGG ATTTCATCAA 
2051 CAATCAAGGC ACTATCAACT ATCTGGTCCG AGGTGGGAAA GTGGCAACCT 
2101 TAAGCGTAGG CAATGCAGCA GCTATGATGT TTAATAATGA TATAGACAGC 
2151 GCGACCGGAT TTTACAAACC GCTCATCAAG ATTAACAGCG CTCAAGATCT 
2201 CATTAAAAAT ACAGAACATG TTTTATTGAA AGCGAAAATC ATTGGTTATG 
2251 GTAATGTTTC TACAGGTACC AATGGCATTA GTAATGTTAA TCTAGAAGAG 
2301 CAATTCAAAG AGCGCCTAGC CCTTTATAAC AACAATAACC GCATGGATAC 
2351 TTGTGTGGTG CGAAATACTG ATGACATTAA AGCATGCGGT ATGGCTATCG 
2401 GCGATCAAAG CATGGTGAAC AACCCTGACA ATTACAAGTA TCTTATCGGT 
2451 AAGGCATGGA AAAATATAGG GATCAGCAAA ACAGCTAATG GCTCTAAAAT 
2501 TTCGGTGTAT TATTTAGGCA ATTCTACGCC TACTGAGAAT GGTGGCAATA 
2551 CCACAAATTT ACCCACAAAC AGCACTAGCA ATGCACGTTC TGCCAACAAC 
2601 GCCCTTGCAC AAAACGCTCC TTTCGCTCAA CCTAGTGCTA CTCCTAATTT 
2651 AGTCGCTATC AATCAGCATG ATTTTGGCAC TATTGAAAGC GTGTTTGAAT 

FIG. 1B 
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1 MEIQQTHRKI NRPLVSLALV GALVSITPQQ SHAAFFTTVI IPAIVGGI AT 
51 GTAVGTVSGL LSWGLKQAEE ANKTPDKPDK VWRIQAGKGF NEFPNKEYDL 
101 YRSLLSSKID GGWDWGNAAR HYWVKGGQQN KLEVDMKDAV GTYTLSGLRN 
151 FTGGDLDVNM QKATLRLGQF NGNSFTSYKD SADRTTRVIS TLKI SQL 1 1 L 
201 QKSTTRVGSG AGRKASSTVL TLQASEGITS DKNAEISLYD GATLNLASSS 
251 VKLMGNVWMG RLQYVGAYLA PSYSTINTSK VTGEVNFNHL TVGDKNAAQA 
301 GI IANKKTNI GTLDLWQSAG LNI IAPPEGG YKDKPNNTPS QSGAKNDKNE 
351 SAKNDKQESS QNNSNTQVIN PPNSAQKTEV QPTQVIDGPF AGGKDTVVNl 
401 NRINTNADGT IRVGGFKASL TTNAAHLHIG KGGVNLSNQA SGRSLIVENL 
451 TGNITVDGPL RVNNQVGGYA LAGSSANFEF KAGTDTKNGT ATFNNDISLG 
501 RFVNLKVDAH TANFKGIDTG NGGFNTLDFS GVTDKVNINK LITASTNVAV 
551 KNFNINELIV KTNGISVGEY THFSEDIGSQ SRINTVRLET GTRSLFSGGV 
601 KFKGGEKLVI DEFYYSPWNY FDARNIKNVE ITNKLAFGPQ GSPWGTSKLM 
651 FNNLTLGQNA VMDYSQFLNL TIQGDFINNQ GTINYLVRGG KVATLSVGNA 
701 AAMMFNNDID SATGFYKPLI KINSAQDUK NTEHVLLKAK I IGYGNVSTG 
751 TNGISNVNLE EQFKERLALY NNNNRMDTCV VRNTDDIKAC GMAIGDQSIW 



801 NNPDNYKYLI GKAWKNIGIS KTANGSKISV YYLGNSTPTE NGGNTTNLPT 

851 NTTSNARSAN NALAQNAPFA QPSATPNLVA INQHDFGTIE SVFELANRSK 

901 DIDTLYANSG AQGRDLLQTL LIDSHDAGYA RKMIDATSAN EITKQLNTAT 

951 TTLNNIASLE HKTSGLQTLS LSNAMILNSR LVNLSRRHTN HIDSFAKRLQ 

1001 ALKDQKFASL ESAAEVLYQF APKYEKPTNV WANAIGGTSL NNGSNASLYG 

1051 TSAGVDAYLN GQVEAIVGGF GSYGYSSFNN RANSLNSGAN NTNFGVYSRI 

1101 LTNQHEFDFE AQGALGSDQS SLNFKSALLQ DLNQSYHYLA YSAATRASYG 

1151 YDFAFFRNAL VLKPSVGVSY NHLGSTNFKS NSTNQVALKN GSSSQHLFNA 

1201 SANVEARYYY GDTSYFYMNA GVLQEFAHVG SNNAASLNTF KVNAARNPLN 

1251 THARVMMGGE LKLAKEVFLN LGVVYLHNLI SNIGHFASNL GMRYSF 
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CTCCATTTTAAGCAACTCCATAGACCACTAAA6AAACTTTTTTTGAG6CTATCTTTGAAA 
GCTTAATTATACATGCTATAGTAAGCATGACACACAAACCAAACTATTTTTAGAACGCTT 
TCAAAAAGATTCATTTCTTATTTCTTGTTCTTATTAAAGTTCTTTCATTTTAGCAAATTT 
CTTTTTTCAATATTAATAATGATTAATGAAAAAAAAAAAAAATGCTTGATATTGTTGTAT 
TTGACACTAACAAGATACCGATAGGTATGAAACTAGGTATAGTA AGGAG AAACAATGACT 

M T 

AATAATCTTCAAGTAGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATACGATCCTGAT 
23NNL0VAFLKVDNAVASYDPD 

CAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAATCAGTAT 
63QLREEYSNKA I KNPTKKNQY 

GAATCTTCCACAAAGAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGT 
103 ESSTKSFQKFGDQRYRIFTS 

GAAAATATCATACAACCCCCTATCCTTGATGATAAAGAGAAAGCGGAGTTTTTGAAATCT 
143 ENI IQPPILDDKEKAEFLKS 

ATGGGCGTGTTTGATGAGTCCTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGAGCCT 
183 MGVFDESLKERQEAEKNGEP 

GATGTCAAAGAAGCAATCAATCAAGAACCAGTTCCCCATGTCCAACCAGATATAGCCACT 
223 DVKEA I NQEPVPHVQPDI AT 

AATTTTTCTAAATTCACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGAGTCGCTGAC 
263 NFSKFTLGDMEMLDVEGVAD 

TTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCATTGTTGTATGGGGGCAAT 
303 LMGSHNG I EPEKVSLLYGGN 

AACAATGTGGCTACAATAATTAATGTGCATATGAAAAACGGCAGTGGCTTAGTCATAGCA 
343 N N V A T I INVHMKNGSGLVIA 

GGCTCACAACGAGCATTAAGTCAAGAAGAGATCCAAAACAAAATAGATTTCATGGAA TTT 
383 GSQRALSQEEIQNKI DFMEF 

ACTGAGATTAAAGATTTCCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGAT 
423 TEIKDFQKDSKAYLDALGND 

AATGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCTTTAGAT 
463 NGDLSYTLKDYGKKADKALD 

TATTCTAATTTCAAATACACCAACGCCTCCAAGAATCCCAATAAGGGTGTAGGCGTTACG 
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ATCTGTCCTATTGATTT6TTTTCCATTTT6TTTCCCAT6TGGATCTTGTGGATCACAAAC 120 
CATGTGCTCACCTTGACTAACCATTTCTCCAACCATACTTTAGCGTTGCATTTGATTTCT 240 
TTGTTAATTGTGGGTAAAAATGTGAATCGTCCTAGCCTTTAGACGCCTGCAACGATCGGG 360 
AATGAGAATGTTCAAAGACATGAATTGACTACTCAAGCGTGTAGCGATTTTTAGCAGTCT 480 
AACGAAACCATTGACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATC 600 
NETIDOQPQTEAAFNPQQFI 
CAAAAACCAATCGTTGATAAGAACGATAGGGATAACAGGCAAGCTTTTGAAGGAATCTCG 720 
QKPIVDKNDRDNRQAFEGIS 
TTTTCAGACTTTATCAATAAGAGCAATGATTTAATCAACAAAGACAATCTCATTGATGTA 840 
FSDFINKSNDLINKDNLIDV 
TGGGTGTCCCATCAAAACGATCCGTCTAAAATCAACACCCGATCGATCCGAAATTTTATG 960 
WVSHQNDPSKINTRSIRNFM 
GCCAAACAATCTTTTGCAGGAATCATTATAGGGAATCAAATCCGAACGGATCAAAAGTTC 1080 
AKQSFAGII IGNQIRTDQKF 
ACTGGTGGGGATTGGTTGGATATTTTTCTCTCATTTATATTTGACAAAAAACAATCTTCT 1200 
TGGDWLDIFLSF IFDKKQSS 
ACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTACTTGATGAAAGGGGT 1320 
TTTDIQGLPPEARDLLDERG 
ATTGATCCCAATTACAAGTTCAATCAATTATTGATTCACAATAACGCTCTGTCTTCTGTG 1440 
IDPNYKFNQLLIHNNAISSV 
GGTGGTCCTGGAGCTAGGCATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGC 1560 
GGPGARHDWNATVGYKDQQG 
GGTGGTGAGAAAGGGATTAACAACCCTAGTTTTTATCTCTACAAAGAAGACCAACTCACA 1680 
GGEKGINNPSFYLYKEDQLT 
CTTGCACAAAATAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAGGAAAAATTCCGA 1800 
LAQNNAKLDNLSEKEKEKFR 
CGTATTGCTTTTGTTTCTAAAAAAGACACAAAACATTCAGCTTTAATTACTGAGTTTGGT 1920 
R IAFVSKKDTKHSAL ITEFG 
AGGGAGAAAAATGTTACTCTTCAAGGTAGCCTAAAACATGAT6GCGTGATGTTTGTTGAT 2040 
REKNVTLQG SLKHDGVMFVD 
AATGGCGTTTCCCATTTAGAAGTAGGCTTTAACAAGGTAGCTATCTTTAATTTGCCTGAT 2160 

FIG. 4B 
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503 YSNFKYTNASKNPNKGVGVT 

TTAAATAATCTC6CTATCACTAGTTTCGTAAGGCG6AATTTA6A66ATAAACTAACCACT 
543 L N N L A I TS FVRRNLCDK LTT 

GAATTGGTTGGAAAAACTTTAAACTTGAATAAAGCTGTAGC1GACGCTAAAAACACAGGC 
583 ELVGKTLN FNKAVADAKNTG 

CATTTAGAGAAAGAAGTAGAGAAAAAATTGGAGAGCAAAAGCGGCAACAAAAATAAAATG 
623 HLEKEVEKKLESKSGNKNKM 

GCTAATAGAGACGCAAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAAAAGGGAA 
663 A N R D A R A I AYAQNLKGIKRE 

GAATTCAAAAATGGCAAAAATAAGGATTTCAGCAA GGCA6AAGAAACACTAAAAGCCCTT 
703 1EFKNGKNKDFSK1 A EETIKAL 

AATGCAGCTTTGAA TGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAA GGTAACGCAA 
7i\Z N A A I N [EFKNGKNKD FS ~R1 V T Q 

AAAGTTGATAATCTCAATCAAGCGGTATCAGTGGCTAAAGCAACGGGTGATTTCAGTAGG 
783 KVDNLNQA VSVAKATGDFSR 

CAAAAAAATGAAAGTCTCAATGCTAGAAAAAAATCTGAAATATATCAATCCGTTAAGAAT 
823 QKNESLNARKKSEI YQSVKN 

AAAAACTTTTCGGACATCAAGAAAGAGTTGAATGCAAAACTTGGAAATTT CAATAACA AT 
863 KNFSD I KKELNAKLGNF |N N N 

CAAGCAGCTAGCCTTGA AGAACCCATTTAC6CT CAAGTTGCTAAAAAGGTAAATGCAAAA 
903 Q A A S L E IE P I Y A~| QVAKKVNAK 

CCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGT AGGGCTTTCAAGGAATCAA 
943 PLKRHDKVDDLSKVI G L S R N Q 

TTTGGCAATCTAGAGCAAACGATAGACAAGCTCAAAGATTCTACAAAACACAATCCCATG 
983 FGNLEQTI DKLKDSTKHNPM 

TACGCTACTAACAGCCACATACGCATTAATAGCAATATCAAAAATGGAGCAATCAATGAA 



FIG. 4C 



A' 




CL\5*^ subclass 



1 



«. -11 



Docket No.: CHIR-031S 

App No.: 09/921,157 Filed: August 2, 2001 

Title: HELICOBACTER PYLORI CYTOTOXIN PROTEIN 
USEFUL FOR VACCINES AND DIAGNOSTICS 
Inventors: Covacci, Et Al. 

Attorney: Felicity E. Groth Phone: (215) 568-3100 

Sheet 9 of 14 



9/14 



NGVSHLEVGFNKVAIFNLPD 

AAAGGATT6TCCCCACAAGAA6CTAATAA6CTTATCAAAGATTTTTT6AGCAGCAACAAA 

KGLSPQEANKL I K DFLSSNK 

AATTATGATGAAGTGAAAAAAGCTCAGAAAGATCTTGAAAAATCTCTAAGGAAACGAGAG 

NYDE VKKAQKDLEKSLRKRE 

GAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAG 

EAKAQANSQKDEIFALINKE 

TTGTCTGATAAACTTGAAAATGTCAACAAGAATTTGAAAGACTTTGATAAATCTTTTGAT 

LSDKLENVNKNLKDFDKSFD 

AAAGGTTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTT 

KGSVKDLG I NPEW ISKVENL 

GCAAAAAGCGACCTTGAAAATTCCGTTAAAGATGTGATCATCAATCAAAAGGTAACGGAT 

AKSDLENSVKDVI INQKVTD 

GTAGAGCAAGCGTTAGCCGATCTCAAAAATTTCTCAAAGGAGCAATTGGCCCAACAAGCT 

VEQALADLKNFSKEQLAQQA 

GGTGTGAATGGAACCCTAGTCGGTAATGGGTTATCTCAAGCAGAAGCCACAACTCTTTCT 

GVNGTL VGNGLSQAEATTLS 

AACAATAA TGGACTCAAAAA CGAACCCATTTATGC TAAAGTTAATAAAAAGAAAGCAGGG 

N N Nl G L K N IE P I Y A] KVNKKKAG 



ATTGACCGACTCAATCAAATAGCAAGTGGTTTGGGTGTTGTAGGGCAAGCAGCGGGCTTC 

I DRLNQI ASGLGVVGQAAGfT" 

GAATTGGCTCAGAAAATTGACAATCTCAATCAAGCGGTATCAGAAGCTAAAGCAGGTTTT 

ELAOKI DNLN QAVSEAKAGF 

AATCTATGGGTTGAAAGTGCAAAAAAAGTACCTGCTAGTTTGTCAGCGAAACTAGACAAT 

NLWVESAKKVPASLSAKLDN 

AAAGCGACCGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAG 



2280 
2400 
2520 
2640 
2760 
2880 
3000 
3120 
3240 
3360 
3480 
3600 
3720 



FIG. 4D 



BY 



5 CLASS 

•MAR vV ? 20© 




WOE*"' 



Docket No.: CMR-031S 
J\pp No.: 09/921,157 Filed: August 2, 2001 

r TilIe: HELICOBACTER PYLORI CYTOTOXIN PROTEINS 

USEFUL FOR VACCINES AND DIAGNOSTICS 

Inventors: Covacci, Et Al. 

Attorney: Felicity E. Groth Phone: (2 1 5) 568-3 1 00 

Sheet 10 of 14 



J 



10/14 



1023 YATNSHI RINSNIKNGAIN 
ATAGTTGCGCATAAT6TA6GAAGCGTTCCTTTGTCAGA6TATGATAAAATTGGCTTC 

1063 IVAHNVGSVPLSEYDKIGF 
GTAAAAGACACTAATTCTGGCTTTACGCAATTTTTAACCAATGCATTTTCTACAGCA 

1103 VKDTNSGFTQFLTNAFSTA 
GGTTTCCAAAAATCTTAAAGGATTAAGGAATACCAAAAACGCAAAAACC ACCCCTTG 

11*13 G F Q K S 

TGAATGCTACCAATTCATGGTATCATATCCCCATACATTCGTATCTAGCGTAGGAAG 

AACTCTGTAAAATCCCTATTATAGGGACACAGAGTGAGAACCAAACTCTCCCTACGG 

GACAGACACTAACGAAAGGCTTTGTTCTTTAAAGTCTGCATGGATATTTCCTACCCC 

CGAAAATTAATTAAGGGTTATAAAGAGAGCATAAACTAGAAAAAACAAGTAGCTATA 

GAAAAATCAGAAAAACCATAGGAATTATCACACCTTATAATGCCCAAAAAAGACGCT 

ATGCCTTTCAAGGTGAAGAGGCAGATATTATTATTTATTCCACCGTGAAAACTTGTG 

ATCTCATTTTTGTGGGTAAAAAGTCTTTCTTTGAGAATTTATGAAGCGATGAGAAGA 

CATTCTTCGCTTCAAAACGCTTTCATAAATCTCTCTAAAGCGCTTTATAATCAACAC 

TTATTAGCGTTACAATTTGAGCCATTCTTTAGCTTGTTTTTCTAGCCAGATCACATC 

CTGCAAATATCCTACAATAGCATCGCCCGAATGGATGAGTAGGGGGGGTGTTGAAAG 

TAAAATAATCACTTCGGGAAAATCTTTAAGGGAGTGAAATAATAACGCATGCAAGTT 

TGCGAAACATTCAAATAGCCTTGTTGTTTCAGGGCATTGTCATAAGCGTTGGATTGG 

GCTAAAATGCTTGGCTCAATCACGCCCACAATAGGGATTTTGGAATGCTTTTGCATC 

TTGAAAAAATCCAAAGCCTCTAAGCCAAATTGCTTGATCGTAGTGGGGTCTTTAGTG 

AGGCTTTTTAAAACGCTAAACCCTCCCACACCGCTATCAAAAACGCCTATTTTCATG 

TCTTCATTGTCCTTAGTTTGTTGCATTTTAGAATAGACAAAGCTT 5925 
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EKATGMLTQKNPEWLKLVNDK 

AACCA6AA6AATATGAAAGATTATTCTGATTCGTTCAA6TTTTCCACCAAGTT6AACAAT6CT 3840 
NQKNMKDYSDSFKFSTKLNNA 

TCTTATTACTGCTTGGCGAGAGAAAATGCGGAGCATGGAATCAAGAACGTTAATACAAAAGGT 3960 
SYYCLARENAEHGI KNVNTKG 

£IAAAAG£GAGGGGITTTTTAATACTCCTTAGCAGAAATCCCAATCGTCTTTAGTATTTGGGA 4080 

TGTGCAAAGTTACGCCTTTGGAGATATGATGTGTGAGACCTGTAGGGAATGCGTTGGAGCTCA 4200 

GCAACATCAGCCTAGGAAGCCCAATCGTCTTTAGCGGTTGGGCACTTCACCTTAAAATATCCC 4320 

AAAAAGACTTAACCCTTTGCTTAAAATTAAGTTTGATTGTGCTAGTGGGTTCGTGCTATAGTG 4440 

ACAAAGATCAAGTTCAAAAAATCATAGAGCTTTTAGAGCAAATTGATCGCGCTCTTAACCAAA 4560 

TGCGATCAGAAGTGGAAAAATACGGCTTCAAGAATTTTGATGAGCTCAAAATAGACACTGTGG 4680 

GTAATCTTTCTTTCTTGCTAGATTCTAAACGCTTGAATGTGGCTATTTCTAGGGCAAAAGAAA 4800 

ATATCTTTAGCGCTATTTTGCAAGTCTGTAGATAGGTAATCTTTTCCAAAGATAATCATTAGA 4920 

AATACCCTTATAGTGTGAGCTATAGCCCCTTTTTGGGAATTGAGTTATTTTGACTTTAAATTT 5040 

GCCGCTCGCATGAAATTCCACTTTAGGGAATGCGTGTGCATTTTTTTTAAGGGCGTATTTTTG 5160 

GGCAAAATGCTCCATAAAATAGCCCTCAATTTTTTGAGCGATTAAGGGAAAATGCGTGCAACC 5280 

TCTAACAATTCGCCCTCTAAAATACTTTCTTCAATCAAAGGCACAAAAAGAGAAGTGGCTAAA 5400 

ATCGTCGCTTTTGTCCCTAGCACTAAAATAGGGGCGTTTTTATCTTTTACTTGTCGCTTGATC 5520 

TCTTCTAAAGCTAGAGCGCTCGCTGTGTTGCATGCCACAATCAATAATTCAATCTGGTGCGGT 5640 

CCATAAGGCACTCTAGCCGTATCGCCATAATAGATGATTTCATCAAATAATTGCGCTTTTAAA 5760 

ACACTTTTTTAATTTAATGGGATTAATTAGGGATTTTATTTTTCATTCATTAAGTTTAAAAAT 5880 
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10 30 50 

AAGCTTGCTGTCAT6ATCACAAAAAACACTAAAAAACATTATTAT TAAG6A TACAAAAT6 

M 

70 90 110 

GCAAAAGAAATCAAATTTTCAGATAGTGCGAGAAACCTTTTATTTGAAGGCGTGAGGCAA 
AKEIKFSDSARNLLFEGVRQ 

130 150 170 

CTCCATGACGCTGTCAAAGTAACCATGGGGCCAAGAGGCAGGAATGTATTGATCCAAAAA 
LHDAVKVTMGPRGRNVL IQK 

190 210 230 

AGCTATGGCGCTCCAAGCATCACCAAAGACGGCGTGAGCGTGGCTAAAGAGATTGAATTA 
S YGAPS I TKDGVSVAKEI EL 

250 270 290 

AGTTGCCCAGTAGCTAACATGGGCGCTCAACTCGTTAAAGAAGTAGCGAGCAAAACCGCT 
SCPVANMGAQLVKEVASKTA 

310 330 350 

GATGCTGCCGGCGATGGCACGACCACAGCGACCGTGCTAGCTTATAGCATTTTTAAAGAA 
DAAG DGTTTATVLAYS I FKE 

370 390 ^10 

GGTTTGAGGAATATCACGGCTGGGGCTAACCCTATTGAAGTGAAACGAGGCATGGATAAA 
GLRNITAGANPI EVKRGMDK 

430 450 470 

GCTGCTGAAGCGATCATTAATGAGCTTAAAAAAGCGAGCAAAAAAGTAGGCGGTAAAGAA 
AAEAI INELKKASKKVGGKE 

490 510 530 

GAAATCACCCAAGTGGCGACCATTTCTGCAAACTCCGATCACAATATCGGGAAACTCATC 
EITQVATI SANSDHNIGKLI 

550 570 590 

GCTGACGCTATGGAAAAAGTGGGTAAAGACGGCGTGATCACCGTTGAGGAAGCTAAGGGC 
ADAMEKVGKDGVITVEEAKG 

610 630 650 

ATTGAAGATGAATTGGATGTCGTAGAAGGCATGCAATTTGATAGAGGCTACCTCTCCCCT 
I EDELDVVEGMQFDRGYLSP 
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670 690 710 

TATTTTGTAACGAACGCT6A6AAAATGACC6CTCAATTGGATAATGCTTACATCCTTTTA 
YFVTNAEKMTAQLDNAYILL 

730 750 770 

ACGGATAAAAAAATCTCTAGCATGAAAGACATTCTCCCGCTACTAGAAAAAACCATGAAA 
TDKKISSMKDILPLLEKTMK 



790 810 Hindlll 

GAGGGCAAACCGCTTTTAATCATCGCTGAAGACATTGAGGGC GAAGCTT TAACGACTCTA 
EGKPLLI IAEDIEGEALTTL 

850 870 890 

GTGGTGAATAAATTAAGAGGCGTGTTGAATATCGCAGCGGTTAAAGCTCCAGGCTTTGGG 
VVNKLRGVLNIAAVKAPGFG 

910 930 950 

GACAGAAGAAAAGAAATGCTCAAAGACATCGCTATTTTAACCGGCGGTCAAGTCATTAGC 
DRRKEMLKDIAILTGGQVIS 

970 990 1010 

GAAGAATTGGGCTTGAGTCTAGAAAACGCTGAAGTGGAGTTTTTAGGCAAAGCTGGAAGG 
EELGLSL ENAEVEFLGKAGR 

1030 1050 1070 

ATTGTGATTGACAAAGACAACACCACGATCGTAGATGGCAAAGGCCATAaCGATGATGTT 
IVIDKDNTTIVDGKGHSDDV 

1090 1110 1130 

AAAGACAGAGTCGCGCAGATCAAAACCCAAATTGCAAGTACGACAAGCGATTATGACAAA 
KDRVAQI KTQI ASTTSDYDK 

1150 1170 1190 

GAAAAATTGCAAGAAAGATTGGCTAAACTCTCTGGCGGTGTGGCTGTGATTAAAGTGGGC 
EKLQERLAKLSGGVAVIKVG 

1210 1230 1250 

GCTGCGAGTGAAGTGGAAATGAAAGAGAAAAAAGACCGGGTGGATGACGCGTTGAGCGCG 
AASEVEMKEKKDRVDDALSA 

1270 1290 1310 

ACTAAAGCGGCGGTTGAAGAAGGCATTGTGATTGGTGGCGGTGCGGCTCTCATTCGCGCG 
TKAAVEEGIVIGGGAALIRA 
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1330 1350 1370 

GCTCAAAAAGT6CATTTGAATTTGCACGATGATGAAAAAGTGG6CTATGAAATCATCATG 
AQKVHLNLHDDEKVGYEI I M 

1390 1410 M30 

CGCGCCATTAAAGCCCCATTAGCTCAAATCGCTATCAACGCTGGTTATGATGGCGGTGTG 
RAIKAPLAQIAINAGYDGGV 

1450 1470 1490 

GTCGTGAATGAAGTAGAAAAACACGAAGGGCATTTTGGTTTTAACGCTAGCAATGGCAAG 
VVNEVEKHEGHFGFNASNGK 

1510 1530 1550 

TATGTGGATATGTTTAAAGAAGGCATTATTGACCCCTTAAAAGTAGAAAGGATCGCTCTA 
YVDMFKEG I IDPLKVERIAL 

1570 1590 1610 

CAAAAT6CGGTTTCGGTTTCAAGCCTGCTTTTAACCACAGAAGCCACCGTGCATGAAATC 
QNAVSVSSLLLTTEATVHEI 

1630 1650 1670 

AAAGAAGAAAAAGCGACTCCGGCAATGCCTGATATGGGTGGCATGGGCGGTATGGGAGGC 
KEEKA TPAMPDMGGMGGMGG 

1690 1710 1730 

ATGGGCGGCATGATGTAAGCCCGCTTGCTTTTTAGTATAATCTGCTTTTAAAATCCCTTC 
M G G M M # 

1750 1770 1790 

TCTAAATCCCCCCCTTTCTAAAATCTCTTTTTTGGGGGGGTGCTTTGATAAAACCGCTCG 

1810 1830 
CTTGTAAAAACATGCAACAAAAAATCTCTGTTAAGCTT 
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